⚡ SIMD Optimization - matmat · Scour

🚀SIMD Parsing zeux.io·

Zigzag decoding with AVX-512

Covers uops.info

Discussed on Hacker News

🧮Compute Optimization shnatsel.medium.com·

Safe SIMD in Rust, even on the inside

Discussed on Hacker News and Lobsters

🔒Type Safety Phoronix·

Rust PNG Image Decoder Now Even Faster: Benefiting Chrome, GNOME, Etc

⚡Parallel Computing indianspeedster.github.io·

Occupancy Math on the AMD MI355X: A From-First-Principles Guide

Discussed on Hacker News, Hacker News, and Hacker News

Less-relevant results

🚀SIMD Text Processing extractingcycles.com·

The IPv4 Parser AI Couldn't Have Written

Covers Compiler Explorer

Discussed on Hacker News

🎯Retrieval Systems arxiv.org·

MonaVec: A Training-Free Embedded Vector Search Kernel for Edge and Offline AI Systems

Covers Easy way to do both: async <-> sync (crates.io dump loading and parsing example)

⚡SIMD Vectorization Tom's Hardware

·

Intel and AMD's new ACE CPU extensions bring an efficient AI-oriented instruction set to x86 — a new design makes matrix multiplication more power- and density-...

Covered by 3 sources including Ambient Irony, acecomments.mu.nu

🔒Type Safety blog.image-rs.org·

Rust PNG crate gets even faster, used by GNOME and Chromium

Covers google/oss-fuzz

Covered by Phoronix

Discussed on Hacker News

🧩RISC-V Assembly atticarun.itch.io·

Foundry-5: browser puzzle game that teaches you real RISC-V assembly

Discussed on Hacker News

🚀Compiler Optimizations hiraditya.github.io·

Loop Unrolling in the ML Era

Discussed on Hacker News

🧮Compute Optimization Sylvain Kerkour·

Hashing at 130 GB/s with XXH3, Rust and SIMD instructions on AMD Zen 5

Discussed on Hacker News

📡Network Protocol Design cr.yp.to blog·

EuroQCI feedback

Covers 2 stories including Four Russian satellites are now within striking distance of an ICEYE radarsat

Covered by Techrights

📊Vector Quantization GitHub·

RunEdgeAI/turboquant.cpp: Near-optimal online vector quantization in C++23 — 1-4 bits per coordinate, no training, no codebooks

Covers TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate

Discussed on Hacker News

⚡Parallel Computing Akin Ocal·

Building a High-Throughput FIX Server

Discussed on Substack

🖥️Modern CPU Phoronix·

Revised AVX-512 xor_gen() Implementation For Linux RAID Yielding More Performance Gains

📊Frequency Analysis arxiv.org·

Not Your Usual FFT: QFT$\rightarrow$FFT via Classical Quantum-Circuit Simulation

📐Linear Algebra arxiv.org·

Evaluating Rust for Sparse Matrix Kernels in Scientific Computing

⚡Parallel Computing arxiv.org·

Diagonal-Budgeted Trotterization for Efficient Quantum Hamiltonian Simulation

🧠Machine Learning arxiv.org·

Experimental Analysis of Neural Network-Based Image Classification on the CIFAR-10 Dataset

🛡️RISC-V Security arxiv.org·

Is RISC-V Ready for Massively Parallel Astrophysical Codes?

No more posts from matmat's subscribed feeds.

Scour all 25,324 feeds Learn more about Feeds

Log in to enable infinite scrolling